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The accessible information of general signal states is obtained by performing a generalized mea- 
surement. In the case that the signal alphabet consists of two states of a qubit system, it is proved 
that a von Neumann (orthogonal) measurement is sufficient to reach the accessible information. 
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Mutual information is the fundamental quantity of in- 
formation that two parties, Alice and Bob, share. As- 
sume that Alice encodes a message by sending a spe- 
cific quantum state p r for each letter in the alphabet 
of the message. The rth letter in the alphabet occurs 
with probability tr(p r ) in the message. Bob sets up a 
measurement apparatus to determine which state was 
sent, described by an n-outcome positive operator valued 
measure (POVM) in a d-dimensional space, the range of 

r 

n 

^IL, =I dx d, n, > Vj < n. (1) 

4=1 

This gives a joint probability matrix 

p rJ = tl' (p r Iij) (2) 

which denotes the probability of Bob to receive outcome 
j and Alice to send I. We denote the marginals of the 
distribution by 

Pr- = J^Prj, P-j = ^Prj- (3) 

3 T 

The mutual information 

/(,,n) = £ Pr >g(/^) (4) 

tells us how much information on average is transmit- 
ted to Bob per sent state [T]. Choosing a POVM that 
maximizes the mutual information defines an important 
quantity; this is called the accessible information 

4cc = max /(p, IT). (5) 

The question which POVM maximizes the mutual in- 
formation, which was raised by Holevo in 1973 [2], is 
generally unanswered and usually addressed numerically 
[3J 31 [S]. Even the simpler question of how many out- 
comes are sufficient is unanswered. It has been shown [B] 
that an orthogonal (von Neumann) measurement, i.e. 

UiUk = 8 lk U k , tr{n fe } = l, (6) 



is in general not sufficient. Levitin [7] conjectured in 1995 
that if Alice's alphabet consists of up to d states, an or- 
thogonal measurement is sufficient. If so, the number of 
outcomes would be equal to the dimension of the Hilbcrt 
space. This conjecture has been shown not to hold in 
general by Shor [5]. In the same paper Shor reported 
that Fuchs and Perez affirmed numerically that if the al- 
phabet consists of two states the optimal measurement is 
an orthogonal measurement. This is the orthogonal mea- 
surement conjecture. For two pure states it was proved 
to be true in arbitrary dimensions by Levitin [7j. 

This conjecture has important experimental and theo- 
retical implications. In an experiment, orthogonal mea- 
surements are generally easier to implement than ar- 
bitrary generalized measurements. From a theoretical 
point, knowing the accessible information is crucial to 
determine the C^-channel capacity [3] and for security 
analysis using the Csiszar-Korner theorem |10j , for exam- 
ple the thresholds for a collective attack on the Singapore 
protocol |llj are obtained by determining the accessible 
information. Work has been done under the assumption 
that this conjecture is true [12]. In the sequel we will 
prove this conjecture for qubits. 

Label the two states of the qubit system by p\ and 
P2- It is always possible to find a basis in which their 
matrix representation is real. Hence, we can conclude 
from Sasaki et al. |13j in an extension of the proof by 
Davies |14j that the maximum number of rank-1 out- 
comes needed to reach the accessible information is three. 
In the same paper Sasaki showed that these outcomes can 
be chosen to be real as well. The outcomes can be written 
II j = \j)(j\ where in general (j\j) < 1. What we will 
show in the following is that two of these vectors have to 
be linearly dependent if the mutual information is max- 
imal. The most general variation of these outcomes is 
given by 

5 \j) = i Yl l TO > e ™J> 4j = £ jm (7) 

m 

with e representing an arbitrary hermitian three by three 
matrix. With this variation, the first order variation of 



2 



the mutual information is given by 



SI = "J^Sprjlog 



Prj 



-2E 



lm((j\p r \m)e m j)\o\ 



For each pair of outcomes (k, I) we can set e m j for {k, 1} 
{m,j} to zero, except for e^l — § an d e (fc = — §• This 
gives us the following set of variations 

](k\p r \l) lo> 
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E< 



Prfc P-l 



(8) 



Since these sets are antisymmetric in fc, Z we get exactly 
three independent pairs. Fix one of the directions, say 
|1), and one vector |0) orthonormal to |1) to complete 
a basis of our real Hilbert space. Any vector \n) can be 
expressed as 



\n)=p (n) |0)+/3 1 (n)|l) 



(9) 



We want to see what are the restrictions from these equa- 
tions ([8| on the vectors. When /3o is zero the conjecture 
is trivially true, since the vector would be proportional to 
|1). Observe that, due to scale invariance of it is al- 
ways possible to divide out (3$ ^ and restrict ourselves 
to \n) = |0) + t 11) with t an arbitrary real number. 

In the following we assume neither p\ nor pi to be 
pure. We will outline how to approach this case at the 
end of the paper. We get: 
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0=<1|G»1 + Pa)|l» OrtfrWlog 



r=l 



Qr(t) 

aiQi(t)+aaQ 2 (t) 



(10) 



with 



0|Pr|0) 

(1| Pr 11)^(11 Pp |l) ' 



(l|Pr|l) 



(1| (J>l+P») |1) 
(11) 

and prime denoting differentiation with respect to t. In- 
troducing 



(l|pr|0) 
(l|Pr|l)' 



Vr 



(0\pr\0) 
(l|Pr|l) 



(12) 



the range for these variables is restricted due to positivity 
of the states to 

< £ r < r] r < oo, < a r < 1, r = 1, 2, ai + aa = 1. 

(13) 

The following theorem is the cornerstone for handling 
the general case: 

Theorem 1. Each function defined by 

Qr(t) 
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f(a,S,r,)(t) = a r Q' r (t) 10; 



r=l 



aiQi(t) + a 2 Q 2 (t) 



(14) 



wii/i constraints given by (13), Q r (t) — t +2t^ r + rj r and 



]' r (t) = 2{t + £ r ), has at most two real roots. 



We will prove this later; first, we use this theorem to 
show our central result: 

Theorem 2. Every local maximum of the mutual infor- 
mation is a von Neumann measurement. 

Proof. Assume that the mutual information is station- 
ary and that POVM is not von Neumann. Observe that 



in ( 14 1 if one logarithm is zero, automatically the other 
is zero as well. Since |2) and |3) have to be distinct, 
Theorem [TJ tells us that one of these states must set the 
logarithm to zero, say |3). This means that 



P-1P13 



P21 



1 <-> 

P-3PU Pll 



P23 
Pl3' 



so outcome 1 and outcome 3 are equivalent. Since the 
same reasoning is applicable to (2| instead of (1| in equa- 
tions ([8|,( 10 1 we find that all outcomes are equivalent and 
we arc in a minimum. □ 
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FIG. 1: Illustrating the function fit) in Theorem |T]) for se- 
lected values, a 1 —02 = 0.5, £2=0 and 772 = 1, other 
parameters as stated in the graph. 



The remainder of this paper will be used to show some 
lemmata to proof Theorem [TJ The graph of / is plotted 
in FigjTJfor selected values of the parameters to illustrate 
its shape. Using Landau little-o notation and expansion 



of the logarithm in (14 1 shows that asymptotically 



f(t)=Q + o(l/t). 



(15) 



In particular, this implies that the function vanishes at 
infinity. Fortunately the term in front of the logarithm is 
linear, so by calculating the second derivative we arrive 
at a rational function. Introducing the abbreviations 



L = Q'xQi — Q' 2 Qu Qs = aiQi + a 2 Q2, 



(16) 
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for the first and second derivative we get 




(gjgg - Q' 2 Qi) ; 



f" = aia 2 



L 



(QiQ 2 Q s ) 2 



with 



P = 3L'(QxQ 2 Q s ) - (QiQ 2 Qs)'L. 



(17) 



(18) 



The second derivative of / is a product of a rational func- 
tion with a maximum of two zeros and no poles - since L 
is of second order and Q1Q2QS strictly positive - and a 
sixth-order polynomial P. In the following we will show 
that the number of real roots of P is limited to two, 
which implies that the number of inflection points of / 
is limited by four. Since / approaches zero at infinity, it 
follows that the number of real roots of / is at maximum 
two, completing the proof of Theorem 1. 

Since the number of roots is translation and scaling 
invariant, it is always possible to set £2 = and r\% = 1. 
It will be convenient in the following to label £1 simply 
as £ and to introduce the difference between r\x and £f, 



X 



(19) 



which is positive because of ( 13 ). Since a direct attack on 



this family of polynomials does not prove successful, we 
will look at its discriminant, showing that it is non-zero 
which implies that on the whole parameter domain no 
double root occurs. Since the coefficients (and therefore 
the roots) of our set of polynomials depend continuously 
on the parameters, any change in the number of roots 
occurs either by going through a double root or changing 
the degree of the polynomial. In particular we will use 
the following 

Theorem 3. Let G be a family of real valued polynomials 
with formal degree n, i.e. 



G 



G z 



k=0 



g{z) k x k 



which is continuous in the coefficients of x, and D a con- 
nected domain of G for which the discriminant of G does 
not vanish. The number of roots of G z is constant on 
each of the following domains 

D = {zeR m \g(z) n = 0}, 
Dx = D\D , 

and the number of roots on D\ is the number of roots on 
D plus one. 

Proof. Assume that the discriminant does not vanish. 
This implies that the polynomials on D\ do not have 



any double root. Since the roots of the polynomials de- 
pend continuously on its coefficients this implies that the 
polynomials of each connected component of D\ have the 
same number of real roots. The non-vanishing of the dis- 
criminant on Dq, where the highest coefficient vanishes, 
implies that the second highest coefficient is non-zero and 
that no double root occurs as well. This fixes the number 
of real roots on each connected component of Do as well, 
and there is exactly one less which went off to infinity. □ 

Corollary 1. The discriminant A(P,t) of P is non- 
vanishing for all £ 2 > with X > and < ax < 1, 
and in the case that £ 2 = for all < X ^ 1 and 
< a\ < 1. 

Proof. The discriminant of P is given by 

A(P,t) = 589824X [(1 - X - £ 2 ) 2 + 4£ 2 ] 7 

x {- [ ai (a 2 e + 1) + a 2 X]} P 2 { ail X,e) 2 

All terms except the last are obviously nonzero, so we 
take a closer look at the last term, which is a fourth 
order polynomial in X, 
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P 2 (ax,X,e) = > Y k (ax,e)X k 



fc=0 



(20) 



We are now going to show that each of the coefficients is 
non-negative and at least one of them is non- vanishing, 
giving us a positive polynomial. The coefficient 



Y4 = q 2 (16 a 2 + a\) 



(21) 



is obviously zero for ax = 1, since a 2 = 1 — ax, and 
positive otherwise. The coefficient 

Y 3 = -4(a 2 ) 2 (3al+4ax - 8)£ 2 

+4a 2 (~3al + 67aj - 196«i + 136) 

is affine in £ 2 . To show that this coefficient is greater 
than zero, we use that 



-3a? - 4ai + 8 > -3 ■ l 2 - 4 + 8 = 1 



and 



-3al + 67aj - 196ai + 136 > -3 + 66aj - 198ai + 136 
= 66(a? - 3ai + 2) + 1 = 66(2 - a x )(l - ax) > 1 

we have 

y 3 >4a 2 (a 2 £ 2 + l) >0 

with y 3 — only if a = 1. The remaining coefficient can 
be treated in a similar fashion so we will just state them 
for completeness. 
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Coefficient Y2: Y2 is a quadratic polynomial in £ 2 : 

Y 2 = 2(-13^ + 34a?-21aJ-8a! + 8)^ 4 

- 2(122ai - 636a? + 914a 2 - 384ai - 16)£ 2 

- 2 (13a? - 26aJ + 405a 2 - 392ai - 8) 

in which all terms can be shown to be non-negative. 
Coefficient Yb : The last term is given by 

Y Q = a 2 1 (l + e) 2 

x [£ 4 (a 2 ) 2 + £ 2 2(1 - a\ + 6 a x a 2 ) + 1 + a 2 + 14a x ] 
> 



a similar expansions of the vectors in a basis as in |9]) 
does not give us, as in this paper, functions of one real 
variable but gives rise to intersections of transcendental 
curves in projective space, leading us into a vast, unex- 
plored territory. 
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This shows that the discriminant is non-zero. 



□ 



To determine the number of roots we need to look 
at one polynomial for conveniently chosen parameters. 
Choose 



£ = 2, X = l, a = 



1 



giving us for ( 18 1 

P= -64(1 + i)(7 + 8t + 8i 2 



At 6 



t 4 ) 



which has exactly one real root. This completes the proof 
for mixed states. In the case that one of the states is pure, 
i.e. £ 2 = 77, the orthogonal measurement conjecture fol- 
lows from continuity in all variables of the mutual infor- 
mation. To show the stronger statement of Theorem 2 
one has to perform a similar analysis as the one done in 
this paper, with the difference, that the second derivative 



(17 1 has a maximum of three roots and one simple pole. 



In this paper we proved the orthogonal measurement 
conjecture for general two qubit states. It is natural to 
ask if this proof can be extended to higher dimensions. 
Already for qutrits we are facing serious difficulties. Two 
qutrits can in general not be transformed to real matri- 
ces with only one unitary transformation. Restricting 
ourselves to real qutrit or qunit matrices we can derive 
equations of similar type as equations (JSj . Performing 
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